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O ! Abstract 

<N ■ 

We consider the online stochastic matching problem proposed by Feldman et al. [3] as a 

model of display ad allocation. We are given a bipartite graph; one side of the graph corresponds 
to a fixed set of bins and the other side represents the set of possible ball types. At each time 
step, a ball is sampled independently from the given distribution and it needs to be matched 
upon its arrival to an empty bin. The goal is to maximize the number of allocations. 

We present an online algorithm for this problem with a competitive ratio of 0.702. Before our 
result, algorithms with a competitive ratio better than 1 — 1/e were known under the assumption 

^**^ ' that the expected number of arriving balls of each type is integral. A key idea of the algorithm 

is to collect statistics about the decisions of the optimum offline solution using Monte Carlo 
sampling and use those statistics to guide the decisions of the online algorithm. We also show 
that our algorithm achieves a competitive ratio of 0.705 when the rates are integral. 

On the hardness side, we prove that no online algorithm can have a competitive ratio better 
than 0.823 under the known distribution model (and henceforth under the permutation model). 

(■><-) ' This improves upon the | hardness result proved by Goel and Mehta [7J for the permutation 

t^- . model. 
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F"^ ■ 1 Introduction 

O. 

We study a natural variation of bipartite matching problem motivated in the context of online 
advertising: suppose we are given a bipartite graph G(Y, Z, E) where Y is the set of stochastic 
nodes (or ball types) and Z is the set of non-stochastic nodes (or bins). At times t = 1, 2, • • • b, a 
ball of type y 6 Y is chosen independently at random from a given distribution. The algorithm 
can assign the ball to at most one of the empty bins that are adjacent to it. Further, each bin can 
be matched to at most one ball. The goal is to maximize the expected number of non-empty bins 
at time b. We refer to this model as the known distribution model. 

When the balls are chosen by an adversary instead of a random process, Karp, Vazirani, and 
Vazirani [10J gave a simple and elegant randomized algorithm that achieves a competitive ratio of 
1 — 1/e. We present the first algorithm for this problem that improves the 1 — 1/e competitive 
ratio for the stochastic version in its general form. Previously, Feldman et al. [4] (and later [T]) 
used a very interesting combinatorial algorithm to show that this is possible when the arrival rate 
of every ball, that is the expected number of times it appears, is integral (this is also known as 
the i.i.d. model). This assumption, even though not very restrictive for the display ad allocation, 
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is somewhat unnatural. For example, when the distribution is uniform, it requires b/\Y\ to be an 
integer. 

One of the key ideas in designing our algorithm is to approximately compute the expected 
matching used by the optimum offline algorithm and use it to guide the decisions of the online 
algorithm. In particular, using Monte Carlo sampling, one can compute f(y }Z ), t ne probability that 
the optimum offline algorithm allocates a ball of type y to a bin of type z, for every y and z. 
Without loss of generality, we can assume / is a fractional matching. 

Our first algorithm writes / as a distribution over integral matchings and samples two matchings 
Mi and Mi from it. Then, in the online phase, it will use these two matchings for allocating the 
arriving balls to the bins (see Section [3]). The analysis of our algorithm is much shorter and 
simpler than both [U [I]. All these algorithms are non-adaptive, in the sense that they decide the 
allocation of all the balls regardless of the allocation of the bins before they arrive. We present 
a simple example to show that no non-adaptive algorithm can achieve a competitive ratio better 
than 1 — 1/e when the arrival rates are non-integral (see Proposition 15. ip . 

The main result of the paper is an adaptive algorithm that obtains a competitive ratio of 0.702 for 
arbitrary rates, and 0.705 for the i.i.d. model (see Section H|). Unlike the non-adaptive algorithms, 
our adaptive algorithm decides the allocation of each arriving ball based on the current allocation 
of the bins. In particular, when a ball arrives the algorithm samples two neighbor bins from a joint 
distribution and tries to match it to the first bin; if the bin is already matched the algorithm tries 
the second bin. To the best of our knowledge, this is the first algorithm that beats the 1 — 1/e 
ratio in the general form. The adaptivity of the algorithm imposes a lot of dependencies in the 
distribution of full bins and because of that our analysis is somewhat intricate. 

On the hardness side, we present an example that gives an upper bound of 0.823 on the compet- 
itive ratio of any deterministic or randomized online algorithm in the known distribution model (see 
Proposition 15. 3p . For analyzing this example, we use the expected size of a maximum matching of 
a random bipartite graph recently computed by [H EJ [5] in the context of random SAT and cuckoo 
hashing. 

1.1 Related Work 

Bipartite matching problems are central in algorithms and combinatorial optimization and arise 
naturally in several applications such as resource allocation, scheduling, and online advertising. 

The online matching problem was first studied by Karp, Vazirani, and Vazirani |10| in the 
adversarial model where the graph is unknown; when a ball arrives it reveals its incident edges. 
They proved that a simple randomized on-line algorithm achieves (1 — 1/e) and this factor is the 
best possible performance. 

More recently, Feldman et al. [4] studied the problem under stochastic assumptions. They 
assumed that the graph is known but the sequence of arrivals are i.i.d. samples from a given 
distribution. Further, they assumed that sampling rates are integral and developed an online 
algorithm that beats (1 — 1/e). They also showed that there is no 1 — o(l)-approximation algorithm 
for this setting. Recently, Bahmani and Kapralov pQ improved the upper and lower bounds of 
Feldman et al. to 0.902 and 0.699 respectively in the same setting. Also, they showed that for 
d-regular graphs, a simple randomized algorithm achieves a competitive ratio of 1 — 0(1 /yd) [Tj. 

Goel and Mehta [7] considered a different stochastic model: they assumed the graph is unknown 
but the sequence of arrivals is a random permutation. This is known as the random permutation 
model, and it is a generalization of the known distribution model. They showed that a greedy 



algorithm achieves (1 — 1/e) factor. Further, they showed that no online algorithm can achieve 
competitive ratio better than |. Since the known distribution model is a special case of the 
random permutation model, our hardness result improves their upper-bound to 0.823. Since the 
first appearance of this paper, Karande et al. [9], and Mahdian and Yan [11] independently improve 
the (1 — 1/e) competitive ratio in the random permutation model to 0.653 and 0.696 respectively. 
A close line of work to the online matching is the online 6-matching and the AdWords problem 
[121 [2]. Mehta et al. [12] developed a (1 — 1/e) online algorithm in the adversarial case. Recently, 
Devanur and Hayes [2] improved the competitive ratio to (1 — e) in the stochastic case where the 
sequence of arrivals is a random permutation or it consists of i.i.d. samples. 

2 Problem Definition 

Let G(Y, Z, E) be a bipartite graph where Y is the set of stochastic nodes (or ball types) and Z is 
the set of non-stochastic nodes (or bins). There is a rate r y associated to every type of ball y £Y. 
The online stochastic matching problem is as follows: at times t = 1, 2, • • • b, a ball of type y G Y is 
chosen independently and with probability proportional to r y . The algorithm can assign this ball 
to at most one of the empty bins that are adjacent to it; each bin can be matched to at most one 
ball. The goal of the algorithm is to maximize the expected number of non-empty bins at time b. 

Without loss of generality, we assume that ^2 yeY r y = &> thus the expected number of balls of 
type y in the sequence is r y . Also, we assume that r y < 1; if a node has a rate greater than 1, we 
can easily split it into a set of identical nodes with rates at most 1. 

We will study two classes of algorithms: non-adaptive and adaptive. A non-adaptive algorithm 
is equivalent to an ordering of the neighbors N(y) of every node y G Y. If z\, Z2, ■ ■ ■ z\N(y)\ ^ s such 
an ordering for y, then the k-th time a ball of type y arrives, the algorithm will allocate it to bin 
Zk if it is empty. If k > \N(y)\ or z^ is full then the ball will not be allocated. On the other hand, 
adaptive algorithms can choose the assignment of every ball when it arrives. 

We will compare our algorithms to the optimum offline solution. Given the sequence of arrived 
balls u = (yi,y2, ■ ■ • , 2/b)> one can compute the optimum allocation, OPT(w), in polynomial time 
by solving a maximum matching problem. Fix a particular maximum matching algorithm and let 
-F(cj) : E —> {0, 1} be the vector indicating which edges are used in the optimum allocation given 
oj. Clearly, OPT(o;) = 1 F(cj) and the competitive ratio of an online algorithm ALG is defined as 
E OPT • I 11 our case, both ALG and OPT are concentrated around their expected values, therefore 
the above competitive ratio is fairly robust (see Feldman et al. [3] for a more detailed discussion). 

Our algorithms will crucially use the optimum offline solution for making decisions. In partic- 
ular, define 

/ = 5>MP(w), (1) 

where P(u>) is the probability of the sequence u = (yi,y2,---,yb)- By definition, / is a convex 
combination of matchings and therefore it is in the convex hull of the matchings of G. We will refer 
to / as the fractional matching defined by OPT. For each edge e = (y, z) G E, f e is the probability 
that a ball of type y is allocated to bin z by the optimum offline algorithm. Similarly we define the 
fractional degree of a node to be f v = ^ e ~u fe f° r v £ Y U Z. 

Proposition 2.1 The vector f is a fractional matching in G. i.e. 

f y <r y <l, y£Y, and f s <l, z G Z. (2) 



Moreover, for e = (y, z), we have f e < 1 — e Vy + o(l/b). 

Proof: Given u), let N y (uj) be the number of balls of type y in uj. Clearly Yle~y Fe{w) < N y (u). 
Taking expectations from both sides results in the first inequality in ([2|). Similarly, the second 
inequality in ^ can be proved by noting that in any instance of the problem, z can be matched 
to at most one ball. Finally, for e = (y, z), we have 



f e < P (N v (u) > 1) = 1 - (1 - -f) b < 1 - e~ r y + o(l/b). 



□ 



Throughout the paper, we will assume that 6 is sufficiently large so that o(l/b) is negligible. 
We will need to compute f e for every edge e. Obviously, / e 's can be computed by enumeration 
in time 0(|Y| 6 ). It is also easy to see that E [OPT] and /(e) for all ee£, can be approximated 
with great accuracy using Monte Carlo method. OPT is an integral random variable which is in 
interval [0, b], hence its variance is upper-bounded by b 2 . Therefore, E [OPT] can be estimated with 
error of o(l/b), by averaging over 0(6 3 ) independent samples of the process. A similar argument 
shows that with 0(|S| 2 6 4 ) samples of uj in equation ([1]), with high probability, one can compute 
the vector / with accuracy within o(l/b\E\). In the rest of the paper, for simplicity of notation, we 
will assume that we have estimated / accurately and ignore o(-) terms. 

Since / is a fractional matching, standard algorithmic versions of Caratheodory's theorem (see 
e.g. [8, Theorem 6.5.11]) say that, in polynomial time, we can decompose a feasible solution in the 
bipartite matching polytope into a convex combination of polynomially many bipartite matchings. 
More specifically, we obtain the following: 

Corollary 2.2 It is possible to efficiently and explicitly construct (and sample from) a distribution 
(j, on the set of matchings in G such that 

J2 KM) = f e , Ve6£ 

M, eeM 

3 A Non-adaptive algorithm 

In this section, we will analyze a simple non-adaptive algorithm for the special case where all rates 
are one, i.e., r y = l,Vy G Y. This is the setting studied in Feldman et al. [4j. Our algorithm and 
its analysis is simpler and more intuitive than [3J. It also gives a slightly better competitive ratio. 

Our non-adaptive algorithm has some similarities with the online algorithm that Feldman et 
al. propose [3J. Both algorithms start by computing two matchings M\ and M2 offline; we use the 
first matching, only for the first arrived ball of each type and the second one only for the second 
arrivals. In particular, when the first ball of type y arrives it will be allocated to the bin matched 
to y in Mi, and when the second ball arrives, we will allocate it via M<i- If the corresponding bins 
are already full, the balls will be dropped. Note that the probability that there are more than two 
balls of each type y in the sequence of arrivals is very small. 

On the other hand, we use a different method from [4j to construct these matchings. Feldman 
et al. find M\ and M2 by decomposing the solution of a maximum 2-flow of G into two disjoint 
matchings (since all the rates are one, the expected graph is simply G). However, we will sample 
our matchings from the distribution [i defined by the optimum solution /. 



Algorithm 1 The Online Non- adaptive Algorithm 



Offline Phase: 

1: Compute the fractional matching /, and the distribution \x using Corollary! 

2: Sample two matchings M\ and M 2 from \x independently; set M\ (M 2 ) to be the first (second) 

priority matching. 
Online Phase: 
3: When the first ball of type y arrives, allocate it through the first priority matching, M\. 
4: Similarly, when a ball of type y arrives for the second time, allocate it through the second 

priority matching, M 2 . 

The outline of the algorithm is presented in Algorithm [H In the rest of this section, we analyze 
Algorithm [I] and show that its approximation ratio is 0.684. Let X z be the random variable 
indicating the event that bin z is matched with a ball during the run of the algorithm. We analyze 
the competitive ratio of the algorithm by comparing E [X z ] with f z : 



E[ALG] _Y, z e Z nX z ] > ^ E [X z 



E[OPT] Ezezf* ~ ^ z fz 

Consider any z G Z, and with a slight abuse of notation let M\{z) denote the stochastic node 
matched to it in M\. More precisely, if (y,z) £ M\, define M\{z) = {y}, and if z is not saturated 
in Mi, define M\(z) = 0; similarly define M 2 (z). Note that z is saturated by M\ (or M 2 ) with 
probability f z , but if M\ (z) = M 2 (z), bin z will only be used for the first arrived ball and effectively 
it is not saturated by M 2 . Given M\ and M 2 , E [X 2 |Mi, M 2 ] can be computed similar to [U section 
4.2.2] by considering the following cases: 

'0 if Mi(z) = M 2 (z) =0 

1-1/e ifMi(z)^0,{MiCz) = M 2 (z)} 

1 - 1/e if Mi (z) + 0, M 2 (z) = (3) 

1-2/e ifMi(z) = 0,M 2 (z)/0 

[ 1 - 2/e 2 if Mi (z) ^ 0, M 2 (z) / 0, M x (z) / M 2 (z) 



E[X Z \M 1 ,M 2 ] = < 



By substituting © into E [X z ] we get: 
E[X Z ] = (1 - 1/e) £ / e (l - / 2 + f e ) + (1 - 2/e) £/,(!_/,) + (1-2/e 2 ) ^ / e / e , 

= f z (2 - 3/e) - / z 2 (l + 2/e 2 - 3/e) - (1/e - 2/e 2 ) £ / 2 



I e 



The last equality can be derived by algebraic manipulation and noting that Yle^z fe = fz- It 
remains to prove a lower bound on the value of the last equation: 

Lemma 3.1 In any graph G = (Y,Z,E), if f is the corresponding vector of the optimum solution, 
we have _^ 

^H = (2 - 3/e) - (1 + 2/e 2 - 3/e)/, - (1/e - 2/e 2 ) ^ e ~ zfe > 0.684 (4) 

J Z JZ 



Proof: The proof of this lemma is mainly algebraic. Let us first fix f z and find the minimum 
of the LHS in terms of f z . For any / 2 , the LHS is minimized when X^e~2 fe 1S maximized. Note 
that Y^ e ~z fe = fz, and thus to maximize the Yl e ~z fei we ne ed to consider the most "unbalanced" 
edge probabilities that are consistent with the properties of fractional matching /. By proposition 
12.11 / e < 1 — e~ 1 for each e ~ z, thus for f z < 1 — e , the term ^2 er ^ z /f is maximized when 
we have only one edge with nonzero probability. Similarly we can show that the summation of 
the probabilities of any 2 edges incident to z is at most 1 — e -2 , thus if 1 — e _1 < f z < 1 — e~ 2 , 
the term 5^e~z fe ^ s maximized when we have two edges with nonzero probability; one edge with 
probability 1 — e _1 and one with probability f z — (1 — e _1 ). Similarly we can proceed to compute 
the maximum of Y2 e ~z fe m terms of f z for all < f z < 1. 

The only remaining task is to find the value f z that minimizes the LHS of ([!]). Intuitively, the 
LHS is minimized when f z = l. In particular, if f z < 1, we may add a dummy node j/ to 7, and 
connect it to z by an edge e = (y, z) with very small probability, i.e. f e = e. It is easy to see that 
this can only decrease the LHS. Also, one can numerically confirm that the LHS of @ attains its 
minimum at f z = 1 with value 0.684. 

□ 

Theorem 3.2 Assuming all the rates are 1, the solution of Algorithm U\ is within 0.684 °f the 
optimum offline solution. 

4 The Adaptive Algorithm 

In the analysis of the non-adaptive algorithm presented in the previous section, we assumed that 
the arrival rates of all stochastic nodes are integral and in particular, they are at least one. This is 
a crucial assumption. If the rates r y s are not bounded from below, the probability of receiving a 
second ball of the same type can become arbitrary low and the competitive ratio of the algorithm 
can get very close to 1 — 1/e. This is the case for all non-adaptive algorithms: In Proposition 15.11 
we show that no non-adaptive (even randomized) algorithm can achieve a competitive ratio better 
than 1 — 1/e when the sampling rates are not necessarily integral. 

In this section, we will analyze a simple adaptive algorithm that will have a better performance 
for arbitrary rates. The algorithm is very simple: when a ball of type y arrives, it samples two 
neighboring bins z\ and Z2 from a joint distribution. If z\ is empty then y is matched to Z\. 
Otherwise, the algorithm will try Z2 and match y to it if it is empty. 

The joint distribution from which z\ and Z2 are chosen, is determined in advance for every ball 
type y and it has the following properties: (i) The probability that z\ is equal to z is equal to f( y , z )- 
The same is true for z-i- Recall that rates are normalized such that ^2 v€ y r (v) = ^ an d thus / is a 
fractional matching, (ii) The joint distribution is such that the probability of Z\ = Z2 is minimized. 
Note that such a joint probability maximizes the possibility that a ball tries a second different bin 
in case the first bin that it tries is full. In what follows, we will present one joint distribution with 
these properties. 

Suppose (y,zi), . . . , (y,Zf-) are the edges incident to y, and without loss of generality assume 
that f( y , Zl ) > ■ • ■ > f( y ,z k )- Also define a dummy edge (y, Zk+i) that is connected to a dummy non- 
stochastic node Zfc+i, with ft y , Zh+1 ) =r y~ fy Note that f(y, Zk+1 ) 1S the probability that OPT drops 
a ball of type y. We will construct two different partitions of the interval I y = [0, r y \. Specifically, 
partitions T y and J y are defined as follows: 
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f ei = 0/ fe 2 = 0.2 






0.5 0.7 0.9 1 



H 1 I I 



21 0.2 0.4 0.5 l 

Figure 1: Illustration of partitions X y and J y for node y with edges e\, e 2 , and ej, 

• Partition X y : let 1^) = [0, f( y>Zl )]; similarly let I {y ^ t) = [Y!f=i /(!/,*,)> Ej=i /(j/,^)]> 2 < z < 
fc + 1. 

• Partition J^: let J^) = [r y - f(y iZl ),r y ], J (y ^ Z2) = [0,f( y , Z2 )], and similarly J^ zi) = 

Ej=2 /(»,*i)> Ej=2 /(v.**)]. 3 < i < fc + i. 

Note that the second partition is obtained by shifting the subintervals of Zy to the left by ft y>zi )- 
Figure [J illustrates the partitions through a simple example. Having X y and J y , the distribution 
is defined as follows: choose a number x uniformly at random from [0, r y ], define z\ y {x) to be z if 
x G I( y ,z)'i similarly define Z2, y (x) to be z' if x € J( y ,z')- It is easy to see that this joint distribution 
has property (i). Also, note that the second partition J y has the minimum possible overlap with the 
first one which implies that the resulting joint probability has property (ii), i.e., for each stochastic 
node y, the probability that Zi >y (.) = %2, y (-) is minimized. Further, if all f( y , z ) s are less than \r y , 
the probability of Z\ tV (.) = Z2, y (-) is equal to zero. 

Observation 4.1 For stochastic node y, suppose (y,z*) is the edge with the maximum probability, 
i-e- f( y ,z*) > f( y ,z), Vz ~ y. If f( y , z *) < \r y then zx, y (x) / z 2>y {x), for all x £ [0,r y \. Otherwise, 
z\, y {x) / Z2,y(x) only for x e [r y - f( y , z *), f( y , z *)}- 

The outline of the algorithm is presented in Algorithm [2j 

Algorithm 2 Online Adaptive Algorithm 
Offline Phase: 

1: Compute the fractional matching /. 

2: For each y £ Y and x G [0, r^], construct the functions Zi >y (-) and Z2, y (-) by defining the 
corresponding partitions I y and J y . 
Online Phase: 
3: If a ball of type y E.Y arrives, choose a number x uniformly at random from interval [0, r y ]. 
4: Match the ball with zi ty (x); 
5: If z\ tV {x) is full, match the ball with Z2, y {x); 



Theorem 4.2 For any graph G and arbitrary set of rates {r y , y £ Y}, the competitive ratio of 
Algorithm^ is at least 0.702. 
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Unlike Algorithm 1, the analysis of Algorithm 2 is fairly intricate, mainly because the adaptivity 
of the algorithm introduces new dependencies. We will present the proof in a few steps to build an 
intuition before getting to the actual calculations. 

Proof: Consider a non-stochastic node z & Z. Bin z can be matched as a first priority bin or as 
a second priority bin. Note that a bin will be matched once it is tried as a first or second priority. 
We define the event A z (t) to be the event that bin z was tried as a first priority bin by time t, i.e., 
at any time 1,2, ... ,t. Also, define B z {t) to be the event that bin z was tried as a second priority 
bin at time t. Using the notation defined in the previous section: 



E[X Z 



P[A z (b) V U b t=1 B z (t)) = P (Mb)) + P {u b t=iB*(t) A Mb) 

= V(A z (b)) + p(u b t=l B z (t) Mb))w{Mb)) 



(5) 



We need to compute ¥(A z (b)). Instead we compute ¥(A z (t)) for 1 < t < b; at each time step, 
the probability that a ball tries z as a first priority bin is equal to the probability that a ball of 
type y arrives, where y is connected to z through edge (y, z), and we choose a point in the interval 

I {yjZ) . This probability is ^T z S(v ' z) - & 



j/ev y 



*r, and we have: 



¥(A z (t)) 



JZ\t 

b' 



Jfz 



+ o(l/b). 



(6) 



Thus ¥ (A z (b)) = l-e-f*. The more difficult part of the analysis is to lower-bound P (u b t=1 B z (t)\ A z (b)). 



To analyze this probability, we define the parameter q z := Yly~ z f 3 



X£j(y,z)V(y,z) 



ldx. Roughly speak- 



ing, we can interpret q z as the fractional degree of z in the second priority. Note that q z < f z and 
the equality holds iff for all y ~ z, If y>z ) fl J(y }Z ) = $• I n Lemma [47] we lower-bound q z in terms of 
f z . The following lemma lower-bounds P (U b =1 B z (t)\ A z (b)) in terms of f z , q z , and the fractional 
degree of nodes at distance 2 form z. 



Lemma 4.3 For any non- stochastic node z we have: 



U b t=1 B z (t) Mb)) > -J2Y, 



'/.- 



i.y 



(x) 



dx 



■(b-t) 



t=l y^z " x £-'(y,z)\Hy,z) 

Proof: Using inclusion-exclusion principle, we have: 

b __ 

P (u b =1 B z (t)\ Mb)) > ^P(S 2 (i)|X(i))- Y, W(B z (t)nB z (u)\Mb)) 

t=l l<t<u<b 

b 



(7) 



^¥{B z (t)\Mb)) 



t=i 



1- Y, P{B z (u)\A z (b)nB z (t)) 



t<u<b 



It is sufficient to upper-bound P (B z (u)\ A z {b) n B z {t)), and to lower-bound P (B z (t)\ A z (b)) . We 
start by showing the former, the latter is proved in Lemma 14.41 

The probability that z is tried at time u conditioned on the event A z (b) is at most the probability 
that a ball of type y arrives, where y ~ z, and a number x £ J(y, z ) \ I(y,z) i s chosen. Note that since 



we are conditioning on the event that z is not tried as a first priority, the sampled point cannot 
belong to J( y . z ) n I( y>z )- By the definition of q z , the total length of the intervals J( y ,z) \ I(y,z) f° r an 
y ~ z is g 2 . 

Conditioning on the event A z (b) implies that during the run of the algorithm, no ball arrives 
for the subintervals I/y tZ ) f° r V ~ z - This condition is equivalent to reducing the rate of any such 
nodes y by f( y , z )- In other words, we choose a point in subintervals with total length of b — f z . 
Hence, the probability that z is tried at time u conditioned on event A z (b) is at most j^rr- Since 
t < u, regardless of whether the event B z (t) happens or not, the probability of B z {u) cannot exceed 
g%- (i.e. P (B z (u)\ A z (b) n B z (t)) < ■gzrf)- Since f z < 1 we can approximate this by ^ with an 
error term of o(l/b) which we ignore for simplicity. 

In lemma S3] we lower-bound P (B z {t)\ A z ibj) (see inequality Putting these together proves 
the lemma. 

□ 



Lemma 4.4 For any non- stochastic node z, and any time 1 < t < b we have: 

P (B t (t)\ AT z (b)) >\Y,f ( l ~ e--^ 1 ) dx (8) 



{y,z 



)\hv,*) 



Proof: The event B z (t) depends on whether the bins at distance 2 from z are full or not. In order 
to incorporate the effect of the allocation of these bins on the matching of z at time t, we study 
the evolution of the density of full bins at distance two from z as follows. For any edge e = (y, z) 
incident to z, define F^ y ^(t) to be those areas from J( v>z \ \ I( y , z ) whose corresponding first priority 
bin is full at time t. In other words, x E Fr yiZ \(t) if Zx, y {x) is full before time t. Also define n z (t) to 
be the sum of the length of those intervals (i.e. n z (t) = ^2 V ^ Z f xe p n\ ldx). First we show that 

P (B z (t)\ A~ z (b)) = ^^, then we lower-bound E [n z (t)}. 

First observe that the bin z will be tried at time t as a second priority iff a ball of type y ~ z 
arrives, and we choose x E Fr y>z ^(t). Thus the conditional probability that bin z is tried at time 

t as the second priority is P (B z {t)\ A z (b) A 7T 2 (t)) = pV- We illustrate this through an example. 
In the graph of Figure Q] let e\ be the only edge adjacent to z\. Suppose at time t, Z2 is full and 
Z3 is empty; we want to compute P [B zi (t)\ A Zl (b) A7r 2l (t)). We have Fiy )Zy \{t) = [0.5,0.7], and 
Tf Z i{t) = 0-2- Since z\ will be tried as a second priority only if the arriving ball is of type y and 
x E Fty jZl \, we get P (B Zl (t)\ A Zl (b) A ir Zl (t)) = gziyg - By the law of iterative expectations we 
obtain: 

f(Si(rt|ll) >!hiM (9) 

It remains to lower-bound E [7T^(t) |^4.^ (6)] . Using definition of ir z (t), we write E [7T z (i)|^4 z (6)] 



as: 



E [7T z (t)\A z (b)] = Yst E [I (x E F {yiZ) (t)) \A z (b)] dx 

= W P(xeF M (t)\A z (b))dx (10) 

»,_., J xPjr,, -,\\Ir„ ,\ 
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It suffices to lower-bound P (x E Fi yz \(t)\A z {b)j . As explained above, Fr yjZ \(t) is a non-decreasing 
random process that depends on the allocation of the bins at distance 2 from z at time t. For 
x £ J(y,z) \ I(y,z)i kt z ' = z i,v( x )' Note that x £ F^ yz ^{t) iff z' is full at time t. Thus it suffices to 
compute the probability that z' is full at time t. Observe that if z' is full at time t, it has been tried 
at least once as a first or second priority bin. Therefore, the probability of z' being full at time t 
is at least the probability of event A z '(t). For simplicity, we ignore the possibility of the trial of z' 
as a second priority and obtain the following lower bound: 



; 1 »' 



P (x £ F (ytZ) (t)\ A z {b)) > P (^ llV (*)(*)l AW) > 1 - e 

where the last inequality follows from (|5|). Substituting the RHS into (|10|) and using Q imply the 
Lemma. □ 

Putting equations ©, ©, together and using e~^ z > e _1 , we can lower bound the compet- 
itive ratio of Algorithm [2j 



E [ALG] 
E [OPT] 



E 



zez 



> 



+ e~ 



b Et=l l^y~z J 3 



x € J (y,z)\I(y,z) 



>J; 



l,y( x ) 



dx [1 



q f(b-t)] 



2-izgZ /; 



(11) 



In the rest of the proof we show that the ratio attains its minimum when the fractional degree 
of all non-stochastic nodes are exactly one, i.e., f z = 1, Vz £ Z. As a warm up, we first analyze 
this extreme case. We have: 



E [ALG] 
E [OPT] 



> 



> 1 



-e^ + e- 1 




-f (6-*)] 


-e-^ + e" 1 


t=l 






e _1 + q z e~ 2 


-e-^-e- 1 )^ 0.702, 







(12) 
1 we have q z > In 2 



where the last inequality follows from the observation that for bins with f z 
(see Lemma 14.71 for a proof). 

In the remaining parts of the proof we need to show if the fractional degree of some bins 
are much smaller than 1, still the competitive ratio of the algorithm remains larger than 0.702. 
Unfortunately, the dependencies between the fractional degree of z and bins at distance 2 from 
z result in a significant change in the probability of z being matched as a second priority. In 
particular, if all of the bins at distance 2 from z have a very small rate (i.e. if f Zl ~ -), then 

O(-). This implies that we can not lower bound the RHS of (llip by lower 



¥(iJ b t=1 B z (t)\A z (b)^ 

bounding the worst matching probability of a bin. Instead, in the following lemma we write the 
probability of z being tried as a second priority bin in terms of a linear function of f z , q z and the 
fractional degree of bins at distance 2 from z. This will enable us to lower-bound the RHS of (11 1 [1 
by a node based ratio: 
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Lemma 4.5 For any non- stochastic node z, we have: 

[u b t=1 B z (t)\ Mb)) > q z e~ l - q\ (\ - eA - e" 1 E I D " /«„(.)] **• (13) 

Proof: The proof of this lemma is mainly algebraic. First note that we can write equation ([7]) as 
P (u b t=1 B z (t)\ Mb)) > E / C(f Zly{x)j q z )dx (14) 

, t ^z ly (x) 

where C(f zi ^ x) ,q z ) : = £ Et=i( 1 ~ e * X 1 ~ T^ 3 ~ *))> is a concave function of / 21>y ( x ); this 
follows from the fact that C(., q z ) is a weighted sum of exponential functions with negative weights. 
Therefore, we can lower-bound C(-,q z ) by a linear function of f Zl r x y Since < f Zl i x \ < 1 we 
have: 

C(f Zhy{x) ,q z ) > C(0,q z ) + [C(l,q z )-C(0,q z )]f ZlAx) = C(l,q z )f Zly{x) , 

where the last equality follows by the observation that C(0, q z ) = 0. On the other hand, we have 
C(l, q z ) = e" 1 - q z {l/2 - e" 1 ). Therefore: 

C(f ZlAx) ,q z ) > (e- 1 - q z {\ - e~ l ))f Zly{x) > e" 1 - q z (l/2 - e" 1 ) - e' x [l - f Zly{x) ] 

The lemma simply follows from substituting the above equation in (|14p . and using the definition 
of q z . D 

Substituting (jT3|) in (fTTj) . we get: 

E [ALG] 

E[OPT] " Y. z ezh 

Next we rearrange the last term of the numerator to eliminate all dependencies between the frac- 
tional degree of z and the bins at distance 2 from z. This enables us to analyze the competitive 
ratio of the algorithm by the worst case ratio among all bins. We can write: 

EE / . t 1 " f^v(4 dx = E E / . t 1 - f^y(4 dx ' 

z eZ y~z Jx ^J{y,z)V( y ,z) z ^z y~z JxeI (y,z)\ J (y,z) 

Here the equality follows from the observation that for all y G Y, both sides are integrating over 
all x G [0,r y ] where Zi >y (x) ^ Z2 >y (x). Since for any x G I( y , z ) \ J ($,£)■> we have zi >y (x) = z, and 

EE / i 1 - f-^ y (4 dx = EE / t 1 - /J da; < E^ 1 - /J- 

Therefore, the competitive ratio of the algorithm is at least: 

E [ALG] . (1 - e-/«) + g z e~ 2 - <&>-!(! - e~ l ) - e~ 2 f z [1 - f z ] 

E[OPT] * ^ A • < 15 > 

Since for < q z < 1 , the RHS is an increasing function of q z , any lower-bound on q z also gives a 
lower-bound on the competitive ratio of the algorithm. In particular, if f z < ^, we can lower-bound 
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E ze z {(1 " *- f ') + ^" 2 - ql & ~\\ - e" 1 ) - e" 2 £_ J. 6J . V . [l - /*,,(»)] dx} 



q z by zero and we get „ r ^T^ n > 



i,i () [»-i-t r 7 — — > 0.719. On the other hand, if f z > ^ we use 

the lower-bound q z > In 2 + f z — 1 (see Lemma 14.71 for the proof), and we obtain that the worst 
lower-bound is attained for bins with fractional degree 1: 



E[ALG1 , n i/, x?,l i, 

_|__J > 1 - e -i + e ~ 2 In 2 - e- 1 (ln2) 2 (- - e" 1 ) > 0.702. 



This completes the proof of Theorem 14.2 



□ 



Remark 4.6 As we discussed earlier (equation (|12p ) the worst competitive ratio of the algorithm 
is attained for bins with fractional degree 1, thus the linear bounds used in the proof of Lemma \4-5\ 
does not change worst case analysis of the algorithm. 

Lemma 4.7 For any non- stochastic node z, we have q z > In 2 + f z — 1 

Proof: The proof follows from Observation 14.11 and an optimization over the sampling rate of the 
neighboring stochastic nodes. 




E° = {(y u z),(y 2 ,z)} 



E n z ={{y^z)} 



Figure 2: An example of a non-stochastic node z with \E°\ > 1. 

Let E z be the set of edges incident to z in graph G. We partition E z into two subsets E° and 
E™, such that E° consists of edges (y, z) where f< y , z ) > \r y , and E™ = E Z \E° are the rest of the 
edges. In words, E° is the set of edges e for which Ir y>z ) and J(y tZ ) overlap. For example, if the rates 
of all stochastic nodes are 1, for any edge (y,z) G E° we must have fty jZ ) > \\ but since f z < 1 we 
must have \E°\ < 1. However, this is not necessarily true if we allow the stochastic nodes to have 
arbitrary rates (see Figure [2] for an example). By Observation 14.11 we have: 



y~z x ^^(y,z)V{y,z) 



ldx 



Yl f(v,*) + Yl 



f y~ Yl f(v,*V 

y:(y,z)eE2 y:{y,z)£E° y:{y,z)€E° 



(16) 



Let /", r°, and f° be the first, second, and the third summations in the RHS, i.e., q z = f z +r z — f z . 
By Proposition 12.11 we can show f° < (1 — e~ Vz ); it is sufficient to replace all stochastic neighbors 
of z with a super node y* of rate r°, and use Proposition 12.11 to conclude that 

/° = / fov) <(l-e-^) = (l-e-^). (17) 

We can obtain a lower bound on q z simply by using the above equations and noting that f z = f z +f z '- 

Qz = f z l + r° z -f° = f z + r°-2f°>f z + r°-2(l-e- r °) 



> f z + hi2-2 + 2e 



In 2 



/ 2 + ln2-l, 
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where the first equality follows from equation (|16p . the first inequality follows from equation (pj 
and the second inequality follows from the fact that r° = In 2 is the minimizer of r° + 2e~ Tz . □ 



Corollary 4.8 If we restrict the sampling rates of all stochastic nodes to be integral (i.e. r y 's are 
integral), then the competitive ratio of Algorithm^ is at least 0.705. 

Proof: The corollary simply follows from a better lower-bound on q z in terms of f z . Since the 
rates are integral we can show q z > f z + 2e~ 1 — 1; in particular, in the proof of Lemma 14.71 assuming 
integral rates, we get r° G {0, 1} which implies that q z > f z + 1 — 2(1 — e _1 ) = f z + 2e _1 — 1. Then 
the corollary follows from plugging this lower-bound into equation (]15p . □ 



5 Upper Bounds for Online Algorithms 

We will present three examples. The first example gives a straightforward 1 — 1/e upper bound 
for the performance of non-adaptive randomized algorithms. It shows that when the rates are 
arbitrarily small, no non-adaptive algorithm can achieve a competitive ratio better than 1 — 1/e. 
Note that a randomized non-adaptive algorithm predetermines distribution T> y> i for the i-th. arrival 
of type y. In other words, when the i-th ball of type y arrives it will be matched to the neighbor 
bin z with probability P^j (z). 

Proposition 5.1 There is an instance of the online stochastic matching problem with small rates, 
r y = o(l), for which no non-adaptive randomized algorithm can achieve a competitive ratio better 
than 1 — e _1 . 

Proof: Suppose G(Y,Z,E) is a complete bipartite graph, where \Y\ = n 2 and \Z\ = n = b; also 
suppose the rate of all types is 1/n. Since G is a complete bipartite graph, OPT can easily allocate 
all the arriving balls and E [OPT] = n. On the other hand, since r y = o(l), with high probability, 
there will be at most one ball of each type. Therefore, any non-adaptive randomized algorithm 
only needs to predetermine one distribution T> y> i for each type y. For each bin z £ Z, let p z be the 
probability that an incoming ball is matched to z. In other words, 

p z = Y, r (y) p s/.i ( z ) = ~ E ^ (z) 

With probability e~ Pz no ball will be matched to the bin z in the run of the process. Thus, 
E [ALG] = ^2 ze z(^ ~ e ~ Pz )- Since function 1 — e~ x is concave we have: 

E[ALG] = i: ze z(l-^) <(1 _ e -, E ^ ) 
E[OPT] n ~ y '' 

On the other hand, we have: 

z&Z z£ZyeY y£Y z£Z 
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*(*) 



r Vl — 1 r V2 — 1 



•y'/Vv'/" 






■ ■ ■ 



|Y 2 | = n/e, / M2 = 1, / e2 = 1/n 
|Z| = n, / z = 1 



*(*) 



Figure 3: Graph structure for the proof of Proposition 



Therefore, EOPT < 1 



e 1 which completes the proof. 



D 



Our next two examples give an upper bound on the performance of any deterministic or ran- 
domized online algorithm. In the first example, the rates are integral. Our upper bound of 1 — e -2 
is slightly better than the result of pQ. 

Proposition 5.2 There exists an instance of the online stochastic matching problem with integral 
rates for which no online algorithm can achieve an expected competitive ratio better than 1 — e -2 ~ 
0.86. 

Proof: Construct a bipartite graph G(Y, Z, E), where Y = Y% U Y2, |Y"i| = \Z\ = n, and | Y2I = n/e. 
The set E of edges consists of a perfect matching between the vertices of Y\ and Z denoted by E\ , 
and a complete bipartite graph between Yi and Z, denoted by Ei. See Figure El 

First, we prove that E [OPT] = n. Given the sequence of arrivals, first we match through the 
perfect matching {E\). I n other words, we match one ball of each type y\ € Y\. Note that with 
probability e -1 , there will be no ball of type j/i, thus, in expectation, (1 — 1/e) fraction of the bins 
will remain empty after matching through E\. On the other hand, the expected number of balls of 
types Y2 is n/e, which can be matched with the n/e empty bins through the edges of the complete 
bipartite graph, E<i- Hence, this simple scheme finds the maximum matching and E [OPT] = n. 

On the other hand, consider an arbitrary online algorithm ALG; at time t, let ^(t) CZbe the 
set of full (matched) bins, and <!>(£) C Y\ be the set of types that have a neighbor in ^{t). If the 
(t + l)-st ball is of type <&(£), it is impossible for ALG to match this ball. Thus: 



|*(i + 1)| <|tf (t)| + 1 (t + I s * ball is not of type $(t)) 



Observe that, 



(t + 1 st ball is not of type #(£)) = 1 



l*(*)l 



n(l + 1/e) ' 
Note that b = n{l + 1/e) balls will arrive, thus E [ALG] = E [|*(n(l + 1/e)) 



(18) 

(19) 
Taking expectations 



from both sides of (J18j) and using (fT9j) result in: 



E [ALG] < n(l + 1/e) x (1 - 1/e) 



1 . 
V 1-^K 
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which proves the claim of the proposition. □ 

Our last and probably most interesting example is for general online algorithms, under arbitrary 
rates. In this example, we use calculations on the size of perfect matchings in random bipartite 
graphs studied earlier in the context of Random SAT and cuckoo hashing O El [5] . 

For a set Z of bins, define Y}. to be a set of (', ') vertices, each connected to a distinct subset 
of cardinality k of Z. These sets will play an important role in constructing examples with large 
competitive ratio. Let us start with a simple example. Consider an instance of online stochastic 
matching where Y = Y3, \Z\ = n. Also suppose that all the rates are equal and b = 0.9n, i.e. the 
rate of each ball r y = ra/( 3 n ). 

From the perspective of the algorithm, we will have a sequence of 0.9n arriving balls each 
connected to three bins chosen independently and uniformly at random. Because of that, all the 
empty bins are equivalent; thus the online algorithm can assign the arriving ball to any of its 
unoccupied neighbors, if there is any. Similar to the proof of Proposition 15.21 let *&(t) C Z be the 
set of full bins at time t, and <£(£) C Y be the set of types of balls that have no neighbor in Z\ \£(£) 
at time t. Note that if the (t + l)-st ball is of type $(t), it is impossible for any online algorithm 
to match it. Note that: 



(t + I st ball is not of type $(£)) = ! 



|$(*)| 1 A"® 



3 



(3) (3) 



Thus we can simply write a recurrence relation to compute the expected performance of the online 
algorithm. 

The more difficult part is to compute the optimum solution. The optimum offline algorithm 
will essentially find the maximum matching between all arrived ball types and the bins. The size 
of this maximum matching is studied by Path and Rodler [13]. There, the problem is defined as 
follows: there are b keys to be hashed into n buckets, each capable of holding a single key. Each 
key has k > 2 (distinct) associated buckets chosen uniformly at random and independently of the 
choices of other keys. A hash table can be constructed successfully if each key can be placed into 
one of its buckets. 

Define c* k to be the threshold such that if b/n < c* k and n is large enough, the resulting bipartite 
graph has a matching of size b. There has been extensive effort to compute c* k [H [5j [3]. In 
particular, it has been shown that C3 > 0.91. Therefore, we can argue that if b/\Z\ = 0.9 < C3 then 
the optimum can match all of the balls with high probability. Dietzfelbinger et al. [3] considered 
an irregular version of the cuckoo hashing, where the number of choices corresponding to a key is 
a random variable depending on the key. In particular, they considered the case where a key has 
2 choices with probability 1/2 and 3 choices with probability 1/2 (say 2.5 choices in average), and 
they defined the number c\ 5 similarly. Interestingly, they show that c\ 5 — 0.81034 which is much 
larger than c^. 

In the next proposition we use a combination of the irregular cuckoo hashing idea and the 
idea of the proof of Proposition 15.21 (adding the type Y n ) to obtain a better upper bound on the 
performance of optimal online algorithms. 

Proposition 5.3 There is an instance of the online stochastic matching problem for which no 
algorithm can achieve a competitive ratio better than 0.823. 
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Proof: Let Y = Y 2 U Y 3 U Y n , \Z\ = 
Suppose in expectation we throw m :- 
of type in Y n . Therefore, we have b = 



n; note that Y n and Z form a complete bipartite graph. 
l/2c2 5 n balls of types in I2, m of types in Y3 and n — 2m 



n. 



and r y = m/Q) for y G Yj}, 



y 



/?? 



/(£) for y G y 3 , and 



r y = n — 2m for y G Y n . The optimum offline solution would first match the balls of types in Y% 



and Y3, and because the expected number of these balls is at most c% 5 n, it can match all of them 
with high probability. Then, it matches all the balls of type Y n to the unoccupied bins. Therefore 
E [OPT] = n. Let ALG be an online algorithm and let VP(i) and $(i) be defined as above. Similar 
to the equation (120j) we can compute the probability that an incoming ball can be matched by 
ALG. Note that if a ball of types in Y n arrives the online algorithm can always match it through 
the complete graph; on the other hand, if a ball of type Y2 or Y3 arrives it can only be matched if 
it has at least one neighbor in Z\SSf(t). Note that: 



(the type of t + I s ' ball is not in $(£)) = 1 



m 

n 



f\9(t)\\ /|*(t)|\ 

/r»\ 



(2) 



Therefore, we have 



E[|*(t + l)|] < E[|*(i)|] + 1 



< EN* 



CT 1 ) + (!V) 




where the last inequality follows from Jensen's inequality. One can numerically compute E [|*(n)|] 
and show that E [|*(n)|] < 0.823n for n > 1000. Thus for n > 1000, we have: 

E [ALG] < E [|*(n)|] < 0.823n, 

which implies that the approximation ratio of the online algorithm is at most 0.823. □ 



6 Discussion 

We should also point out that competitive analysis is not the only possible or necessarily the most 
suitable approach for this problem. Because the distribution from which the input is generated is 
known, one can use dynamic programming (or enumeration of future events) to derive the optimal 
allocation policy. Unfortunately, the dynamic programming approach takes exponential time. In 
fact, one can show that the problem of computing the optimal allocation policy in NP-hard. We 
leave it as an open problem whether it is possible to come up with a polynomial-time algorithm 
with an approximation guarantee that is better than the best possible competitive ratio for this 
problem or the competitive ratio that we obtain here. 
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